Goto

Collaborating Authors

 Brazil


Classical Planning with LLM-Generated Heuristics: Challenging the State of the Art with Python Code

Neural Information Processing Systems

In recent years, large language models (LLMs) have shown remarkable performance in many problems. However, they fail to plan reliably. Specialized attempts to improve their planning capabilities still produce incorrect plans and fail to generalize to larger tasks. Furthermore, LLMs designed for explicit "reasoning" fail to compete with automated planners while increasing computational costs, which reduces one of the advantages of using LLMs. In this paper, we show how to use LLMs to always generate correct plans, even for out-of-distribution tasks of increasing size.


Simple and Effective Specialized Representations for Fair Classifiers

Neural Information Processing Systems

Fair classification is a critical challenge that has gained increasing importance due to international regulations and its growing use in high-stakes decision-making settings. Existing methods often rely on adversarial learning or distribution matching across sensitive groups; however, adversarial learning can be unstable, and distribution matching can be computationally intensive. To address these limitations, we propose a novel approach based on the characteristic function distance. Our method ensures that the learned representation contains minimal sensitive information while maintaining high effectiveness for downstream tasks. By utilizing characteristic functions, we achieve a more stable and efficient solution compared to traditional methods. Additionally, we introduce a simple relaxation of the objective function that guarantees fairness in common classification models with no performance degradation. Experimental results on benchmark datasets demonstrate that our approach consistently matches or achieves better fairness and predictive accuracy than existing methods. Moreover, our method maintains robustness and computational efficiency, making it a practical solution for real-world applications.


Representation Gap: Explaining the Unreasonable Effectiveness of Neural Networks from a Geometric Perspective

arXiv.org Machine Learning

Characterizing precisely the asymptotic generalization error of neural networks using parameters that can be estimated efficiently is a crucial problem in machine learning, which relies heavily on heuristics and practitioners' intuition to make key design choices. In order to mitigate this issue, we introduce the Representation Gap, a metric closely related to the generalization error, but admitting better-behaved asymptotic dynamics. Focusing on equivariant diffusion models and leveraging results from optimal quantization and point-process theory, we derive a precise asymptotic equivalent of the Representation Gap and show that it is governed by a single parameter, the \textit{intrinsic dimension} of the task, which is easy to interpret, efficient to estimate, and can be linked to the equivariances of common neural network architectures. We show that this asymptotic dynamic also extends to a broader range of tasks and training algorithms. Finally, we demonstrate empirically that our asymptotic law and intrinsic dimension estimation are accurate on a wide range of synthetic datasets, where these quantities are known, as well as on more realistic datasets, where we obtain results consistent with the related literature.


Air France and Airbus found guilty of manslaughter over 2009 plane crash

BBC News

Air France and Airbus have been found guilty of manslaughter over a 2009 plane crash which killed 228 people. The Paris Appeals Court found the airline and aircraft manufacturer guilty of corporate manslaughter over the incident, in which flight AF447 between Rio de Janeiro and Paris crashed into the Atlantic Ocean. The passenger jet stalled during a storm and plunged into the water, killing all on board. A court had previously cleared the companies in April 2023 but they were found guilty after this appeal. The Airbus A330 vanished from radars during a storm, with its wreckage found after a long search of 10,000 sq km (3,860 sq miles) of sea floor.


Forecasting Oncology Demand Trends with Boosting-Based Bayesian Conjugate Models

arXiv.org Machine Learning

Accurate trend forecasting in healthcare time series is essential for planning and resource allocation. This paper proposes a Bayesian framework for predicting oncology demand trends, modeling weekly appointments as a Poisson process with a Gamma prior to the demand rate. To enhance adaptability and capture persistent directional patterns, we incorporate a residual-based boosting mechanism grounded in a Gamma-Log-Normal conjugate structure. This boosting approach allows the model to track both short- and long-term trend shifts while maintaining the analytical tractability of conjugate Bayesian updating. The methodology was evaluated on real oncology service data from Cariri, Ceara, Brazil, and compared against established baselines, including linear regression, ARIMA, naive forecasting, LSTM neural networks, and XGBoost. Results showed that the proposed model outperforms competing methods in trend detection accuracy, with gains in terms of percentage of correct direction of 38.25% in relation to the second best approach in some cases.


Large margin classifier with graph-based adaptive regularization

arXiv.org Machine Learning

This paper introduces the use of per-class regularization hyperparameters in Gabriel graph-based binary classifiers. We demonstrate how the quality index used for regularization behaves both in the margin region and in the presence of outliers, and how incorporating this regularization flexibility can lead to solutions that effectively eliminate outliers while training the classifier. We also show how it can address class imbalance by generating higher and lower thresholds for the majority and minority classes, respectively. Thus, rather than having a single solution based on fixed thresholds, flexible thresholds expand the solution space and can be optimized through hyperparameter tuning algorithms. Friedman test shows that flexible thresholds are capable of improving Gabriel graph-based classifiers.


Gromov-Wasserstein Methods for Multi-View Relational Embedding and Clustering

arXiv.org Machine Learning

Learning low-dimensional representations from multi-view relational data is challenging when underlying geometries differ across views. We propose Bary-GWMDS, a Gromov-Wasserstein-based method that operates directly on distance matrices to learn a consensus embedding preserving shared relational structure. By leveraging intrinsic distances, the approach naturally handles nonlinear distortions across views. We also introduce Mean-GWMDS-C, a clustering-oriented formulation that averages distance matrices and learns reduced-support representations via a consensus Gromov-Wasserstein transport. Experiments on synthetic and real-world datasets show that the proposed framework yields stable and geometrically meaningful embeddings.


China car giant BYD says it can thrive without US

BBC News

The recent surge in fuel prices due to the war in Iran has spurred demand for electric vehicles around the world, and Chinese car makers are making the most of the opportunity. China is the world's top producer of EVs, and while its manufacturers remain largely shut out of the major car market of the United States, they are benefiting from an uptick in interest and orders via dealerships across Asia and elsewhere. BYD, which overtook Tesla as the world's largest seller of electric vehicles last year and is expanding aggressively overseas, is at the centre of this shift in focus. We survive and are successful without the US market today, BYD executive vice president Stella Li told the BBC at the Beijing Auto Show. Instead of aiming for US customers, the company says its challenge is meeting increased demand in other regions, including Brazil, the UK and Europe.


What does the data tell us about immigration in Wales? Search for your area

BBC News

What does the data tell us about immigration in Wales? Like many countries, Wales sees a steady flow of people arriving and leaving for other countries each year. The difference between those arriving and those leaving is known as net migration. Focusing on people moving from abroad, latest estimates say Wales' population - which was 3.2 million in June 2024 - had increased by about 23,000 over the previous year as a result of net international migration. A recent YouGov poll found a quarter of people surveyed in Wales believed that immigration, alongside the economy, should be among the issues prioritised by the Welsh government, even though immigration is controlled by the UK government.


Steve Rosenberg: Kremlin's tightening grip on internet fuels public discontent

BBC News

Near the Kremlin several dozen people are queuing outside the presidential administration office. They've come to submit petitions calling on President Vladimir Putin to end a crackdown on the internet. Russian authorities have been tightening control of the country's cyber space. Access to global messaging apps has been restricted and there are widespread disruptions to, even shutdowns of, mobile internet. Petitioning the president is legal.